Development of Filipino Phonetically - Balanced Words and Test Using Hidden Markov Model

نویسندگان

Arnel C. Fajardo

Yoon-joong Kim

چکیده

In this paper, two sets of phonetically balanced words (PBW) in Filipino were developed; namely the 2syllable, and 3-syllable PBW list. These are tested as a speech corpus in a word-level recognizer using the Hidden Markov Model (HMM) as a framework and Mel-Frequency Cepstral Coefficient (MFCC) as a feature extraction technique. Thus, this study is a preparation for a Largecorpus Filipino Language ASR using HMM. For the testing of the PBW sets, fifty speakers were trained (25 male and 25 female speakers). For the recognition of the 2-syllable word list, an average accuracy rate of 93.25% and 88.67% were achieved for the speaker dependent and speaker independent tests, respectively. For the recognition of the 3syllable word list, the recognizer achieved an accuracy rate of 99.53% and 96.30% for the speaker dependent and speaker independent tests, respectively.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Development of a Real-time Asr System for Slovak Speechdat Database

This paper describes development of a real-time speech recognition system in Slovak for the voice-operated telephone services. The system is based on SPHINX2 platform. The decoder using Hidden Markov Models was trained on the SpeechDat-E Slovak database. It is speaker independent, large vocabulary, continuous speech real-time automatic speech recognition system. Test results are given for the t...

متن کامل

Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function

The paper presents a new variant of parameter estimation methods for discrete hidden Markov models(HMM) in speech recognition. This method makes use of a codeword dependent distribution normalization(CDDN) and a distance weighting by fuzzy contribution in dealing with the problems of robust state modeling in a FVQ based modeling. The proposed method is compared with the existing techniques usin...

متن کامل

Vergina: A Modern Greek Speech Database for Speech Synthesis

The present paper outlines the Vergina speech database, which was developed in support of research and development of corpus-based unit selection and statistical parametric speech synthesis systems for Modern Greek language. In the following, we describe the design, development and implementation of the recording campaign, as well as the annotation of the database. Specifically, a text corpus o...

متن کامل

Automatic recognition of Korean broadcast news speech

This paper describes preliminary results of automatic recognition of Korean broadcast-news speech. We have been working on flexible vocabulary isolated-word speech recognition, and the same HMM models are used for broadcast-news continuous speech recognition. The recognizer is trained by using phonetically balanced isolated words speech, rather than the broadcast news speech itself. In this res...

متن کامل

Factorial HMMs for acoustic modeling

Despite the success of hidden Markov models (HMMs) and other techniques for speech recognition, there remains a wide perception in the speech research community that new ideas are needed to continue improvements in performance. This paper represents a contribution to this effort. We describe preliminary experiments using an alternative modeling approach known as factorial hidden Markov models (...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2013

Development of Filipino Phonetically - Balanced Words and Test Using Hidden Markov Model

نویسندگان

چکیده

منابع مشابه

Development of a Real-time Asr System for Slovak Speechdat Database

Speech recognition using an enhanced FVQ based on a codeword dependent distribution normalization and codeword weighting by fuzzy objective function

Vergina: A Modern Greek Speech Database for Speech Synthesis

Automatic recognition of Korean broadcast news speech

Factorial HMMs for acoustic modeling

عنوان ژورنال:

اشتراک گذاری